Model Selection

8-bit quantization

# 8-bit quantization

ERNIE 4.5 21B A3B PT 8bit

ERNIE-4.5-21B-A3B-PT-8bit is an 8-bit quantized version of Baidu's ERNIE-4.5-21B-A3B-PT model, converted to MLX format and suitable for Apple Silicon devices.

Large Language Model Supports Multiple Languages

Jan-nano-8bit is an 8-bit quantized version converted from the Menlo/Jan-nano model, optimized for the MLX framework and suitable for text generation tasks.

Large Language Model

Josiefied DeepSeek R1 0528 Qwen3 8B Abliterated V1 8bit

This is an 8-bit quantized version in MLX format converted from the DeepSeek-R1-0528-Qwen3-8B model, suitable for text generation tasks.

Large Language Model

Deepseek R1 0528 Qwen3 8B MLX 8bit

An 8-bit quantized version based on the DeepSeek-R1-0528-Qwen3-8B model, optimized for Apple Silicon chips and suitable for text generation tasks.

Large Language Model

lmstudio-community

Devstral Small 2505 8bit

Devstral-Small-2505-8bit is an 8-bit quantized model converted from mistralai/Devstral-Small-2505, suitable for the MLX framework and supporting text generation tasks in multiple languages.

Large Language Model Supports Multiple Languages

Fastvlm 1.5B Stage3 MNN

FastVLM-1.5B-Stage3-MNN is a text generation model based on the Transformer architecture. It is an 8-bit quantized version of FastVLM-1.5B-Stage3, suitable for text generation scenarios such as chatting.

Large Language Model English

Spark TTS 0.5B 8bit

This is a text-to-speech model based on the MLX format, supporting both English and Chinese, converted from prince-canuma/Spark-TTS-0.5B.

Speech Synthesis Supports Multiple Languages

This is a text-to-speech model converted from sesame/csm-1b to MLX format, supporting the English language.

Speech Synthesis Supports Multiple Languages

Qwen3 235B A22B 8bit

This model is an 8-bit quantized version converted from Qwen/Qwen3-235B-A22B, suitable for text generation tasks.

Large Language Model

Orpheus 3b Korean FT Q8 0.gguf

Orpheus is a high-performance Korean text-to-speech model, fine-tuned for natural emotional speech synthesis, offering an 8-bit quantized version for optimized efficiency.

Speech Synthesis Supports Multiple Languages

Orpheus 3b German FT Q8 0.gguf

Orpheus is a high-performance German text-to-speech model, fine-tuned to achieve natural and emotionally rich speech synthesis. This model is an 8-bit quantized version of the 3-billion-parameter model, optimized for operational efficiency.

Speech Synthesis Supports Multiple Languages

Gemma 3 27b It Qat 8bit

Gemma 3 27B IT QAT 8bit is an MLX-format model converted from Google's Gemma 3 27B model, supporting image-to-text tasks.

Transformers Other

Orpheus 3b 0.1 Pretrained 8bit

This is an 8-bit quantized version of the Orpheus-3B pre-trained language model based on the MLX framework, originally developed by CanopyLabs.

Large Language Model English

Omnigen V1 Bnb 8bit

The 8-bit quantized version of OmniGen-v1, suitable for text-to-image and image-to-image tasks, supporting multimodal input.

This is the 8-bit quantized version of EleutherAI's GPT-J 6B parameter model, optimized for running and fine-tuning on limited GPU resources (e.g., Colab or 1080Ti).

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase